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Abstract 

We address the question of why particular laws were selected for the universe, 
by proposing a mechanism for laws to evolve. Normally in physical theories, time- 
less laws act on time-evolving states. We propose that this is an approximation, good 
on time scales shorter than cosmological scales, beyond which laws and states are 
merged into a single entity that evolves in time. Furthermore the approximate dis- 
tinction between laws and states, when it does emerge, is dependent on the initial 
conditions. These ideas are illustrated in a simple matrix model. 
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1 Introduction 

Physics has for most of its history been primarily concerned with finding out what the 
laws of nature are. While we still do not have a completely unified theory of physics, 
our understanding of the laws of nature has advanced to the point where we are not 
only interested in what the laws are, but why these are the laws, and not others. This 
problem has become urgent since the discovery of the landscape of string theoriesUHIH. 
The hope that a theory that unifies gravity and the standard model of particle physics 
would be unique, in a way that leads to unique predictions for beyond the standard model 
physics, seems difficult to sustain in the face of a vast or infinite number of apparently 
equally consistent string vacua. Even if one is not confident that string theory is the right 
framework for unification, no framework has appeared which would answer the why 
these laws question. 

The realization that we would sooner or later have to explain how and why the laws 
we observe governing our universe were chosen is not new. The issue was emphasized 
by John Wheeler[5J, but the concern is much older and goes back to Leibniz's Principle of 
Sufficient Reason. As the American pragmatist philosopher Charles Sanders Pierce wrote 
in 1893, nothing is so needing of rational explanation than laws of nature. Pierce goes on 
to sayBU, 

To suppose universal laws of nature capable of being apprehended by the mind and 
yet having no reason for their special forms, but standing inexplicable and irrational, 
is hardly a justifiable position. Uniformities are precisely the sort of facts that need to 
be accounted for. Law is par excellence the thing that wants a reason. Now the only 
possible way of accounting for the laws of nature, and for uniformity in general, is to 
suppose them results of evolution. 

In contemporary work, all the present attempts to understand how laws may have 
been chosen from a landscape of possible laws evoke, in one way or another, the no- 
tion that the effective low energy laws change on cosmological time scales. This includes 
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eternal inflation[7] and cosmological natural selection||8]]. It is notable that both require a 
notion of time to give meaning to the evolution of effective laws. In cosmological natural 
selection a time is required to count generations and give sense to an ensemble of uni- 
verses on the landscape at a fixed time, in eternal inflation it appears necessary to impose 
a measure which is related to a notion of time on the multiverse. 

The view that laws have to evolve in a physically real, non-emergent time, in order to 
have a scientific explanation of why these laws has been developed in work in progress 
with Roberto Mangabeira Unger[lj. This note explores a suggestion made there, which 
is that the evolution of laws implies a breakdown of the distinction between law and 
state. Another way to say this is that there is an enlarged notion of state-a metastate- 
which codes information needed to specify both an effective law and an effective state, 
that the effective law acts on. The whole metastate evolves in time, and the distinction 
between effective law and effective state can only be made for certain time scales. How 
long these time scales are, as well as the effective laws, are determined by the initial 
metastate. The effective laws evolve with the state, but they evolve slowly, compared to 
other information captured by the state. 

Hence, on short time scales, and to a certain approximation, one can distinguish a 
slowly varying effective law which generates faster evolution of an approximate state. 
On longer time scales the more precise picture is that there is a notion of a meta-state, 
which codes both the effective law and the effective state. 

The purpose of this paper is to present a simple matrix model where this idea is re- 
alized. But in realizing this idea we have to confront an issue that arises in any scenario 
in which laws of physics evolve. In both cosmological natural selection[8j and eternal 
inflation[7j there is posited a dynamical mechanism whereby a population of regions of 
the universe with different laws evolves, giving rise to an evolving distribution on a land- 
scape, or space of laws, C [3J. The evolution of laws on the landscape is then driven by a 
metalaw. Even if not precisely specified, this metalaw becomes a key part of the explana- 
tion of why these laws. 

In the case of cosmological natural selection, the metalaw is approximate and effective 
and involves small random changes in the parameters of the standard model. This is 
analogous to an effective dynamics for evolution of phylogeny in biology. In the case 
of eternal inflation the metalaw is tunneling from false vacua with amplitudes given by 
string theory. 

However, these scenarios have a weak point. The postulation of metadynamics on 
the landscape is a scientific hypothesis. How is it to be justified? If there is no principle 
which determines the law, it is not likely there will be a principle which determines the 
metalaw. And how is the proposed metalaw to be tested? One can easily imagine differ- 
ent hypotheses for the action of metalaws on the spaces of observable parameters. How 
are these to be compared, when we see in our past at best one instance of the metalaws 
acting? Someone may claim that the evolution on the landscape is driven by some funda- 
mental version of string theory. Someone else may claim the evolution on the landscape 
is fundamentally stochastic (and why not-so is quantum theory?) and driven only by a 
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simple set of rules. How are we to determine scientifically which is right? 

Worse, we may have to postulate some metalandscape of metalaws on which a meta- 
meta-law acts to govern the choice of the metalaw. There is clearly a danger of an infinite 
regress here. 

On the other hand, if one does not specify a metalaw one explains nothing. We call 
this the metalaws dilemma^. 

It is important to have a precise idea of what is going wrong when we encounter this 
kind of dilemma. Normally in physics we specify a theory in two steps. First we specify 
the configuration or a phase space, C, which is a timeless space of possible configurations 
a system may have at one time. Then we specify the laws of motion, which generates 
the possible lawful trajectories of the system on C. If we append to this the landscape of 
possible laws we have two timeless configuration spaces: that of configurations and that 
of laws. 

This formulation of laws of nature can be called the Newtonian paradigm because it 
is the basic framework of laws of motion introduced by Newtonjl]. The Newtonian 
paradigm is also the framework for modern quantum mechanics, quantum field theory, 
and general relativity. In each case there is a timeless space of states acted on by a timeless 
law. 

The Newtonian paradigm is the proper setting for most of physics, which concerns 
small subsystems of the universe. But when we attempt to scale it up to a description of 
the universe as a whole it leads to unanswerable questions such as why these laws and not 
others and what caused the initial conditions. No theory formulated within the Newto- 
nian paradigm can answer these questions because it takes the laws and initial conditions 
as inputs. When we attempt to invent a theory of evolution on a landscape of theories, 
but stay within the Newtonian paradigm, we end up with puzzles and paradoxes. 

Part of the problem is the following. The Newtonian paradigm is based on a strict sep- 
aration of the roles of law and initial conditions. This is justified by the fact that we can 
operationally distinguish the influence of the choice of laws from the choice of initial con- 
ditions, by doing experiments many times varying the initial conditions. Operationally, 
what we mean by a law is some feature of the evolution which is invariant or conserved 
when we vary the initial conditions. So the experimental context that gives meaning to 
theories formulated in the Newtonian paradigm is the study of small subsystems of the 
universe, where we can repeat an experiment as many times as needed. 

In cosmology there is only a single history, so we loose the ability to do an experiment 
over and over again, while varying the initial conditions. So we have no operational 
way to absolutely distinguish the influence of the choice of laws from the choice of initial 
conditions. When we attempt to impose the Newtonian paradigm on the interpretation 
of cosmological data, and ask questions that assume a strict separation between the role 
of law and the role of initial conditions, we end up asking confused questions that have 
no clear answers. 

We call this running into the cosmological fallacy which is the mistake of extending 
a method that is designed to study small subsystems of the universe that come in many 
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copies to the universe as a whole. To usefully apply a theory in the Newtonian paradigm 
to a system we require data from many repetitions of an experiment to give operational 
meaning to its basic terms, and in particular, to separate out the role of laws from initial 
conditions. But in the cosmological case, the data does not allow that distinction to be 
rnade^. 

It is probably wiser to not impose a paradigm for dynamical law on the cosmological 
data that is based on a distinction that cannot be made within that data. That is, once we 
loose the ability to distinguish the role of law and initial conditions in the data, because 
we have just one case in cosmology, we are probably going to make more progress if we 
search for a framework for physical theory that does not rely on the distinction between 
law and initial condition being absolute. 

What is then needed is a new paradigm for dynamics on a cosmological scale. In 
this new framework, the absolute distinction between laws and states, or laws and initial 
conditions, which underlies the Newtonian paradigm can be transcended. That distinc- 
tion will be seen to be an artifact of descriptions of small subsystems of the universe, and 
breaks down on cosmological time scales. The challenge is to introduce such a framework 
without falling into a vicious circle or the metalaws dilemma. The purpose of this paper 
is to explore one possible form that such a new approach to cosmological dynamics may 
take. 

In previous work||9]|, I proposed a possible resolution to these conundra, which is that 
there could be a notion of universality of metalaws, analogous to universality in the the- 
ory of computation. The idea is that any metalaw which could serve as such is equivalent 
to any other. Computation is universal because any computer can emulate any other ex- 
actly. The proposal in [9] is that any metalaw worthy of that name can emulate any other, 
because they will lead to the same predictions for the evolution of laws. In |9J, I made a 
proposal for such a universal metalaw in the context of a matrix model. 

In this note, I propose a model for another approach to the metalaws dilemma, which 
is that the distinction between states and laws breaks down. This new proposal is also 
realized in a simple matrix model. Instead of timeless law determining evolution on a 
timeless space of states, we have a single evolution which cannot be precisely broken 
down into law and state. Formally, what this means is to embed the configuration space 
of states, C and the landscape parameterizing laws, £ into a single meta-configuration 
space, M. The distinction between law and state must then be both approximate and 
dependent on initial conditions. 

There is, it must be granted, an evolution rule on M., but we can choose an evolution 

1 The multiverse seems at first to be a way to avoid this, because it makes our universe one out of many 
and so appears to reproduce the context needed to make sense of the separation between law and initial 
condition. This however, cannot succeed so long as we have no data about the other universes, because 
there is still no operational basis for the distinction between law and initial conditions. The only exception, 
is special cases where each universe in the ensemble shares a property-then one can check the theory by 
seeing if our universe has that property. This is the strategy of cosmological natural selection. It also leads 
to a single prediction for eternal inflation, which is that all universes in the mutliverse have k = — 1. 
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rule that is almost entirely fixed by some natural assumptions. The remaining freedom 
is, I conjecture, accounted for by the principle of universality, which I just described. Be- 
cause the complexity of the effective law is now coded into the state, the meta-law can be 
very simple, because all it has to do is to generate a sequence of matrices, in which the dif- 
ferences from one to the next are small. The metalaw dillemma is addressed by showing 
that the form of this rule is almost completely fixed by some natural assumptions, with 
the remaining freedom plausibly accounted for by universality. 

In this model of a metatheory, the metastate is captured in a large matrix, X, which we 
take to be antisymmetric and valued in the integers. It might describe a labeled graph. 
The metalaw is a simple algorithm that yields a sequence of matrices, X n . The rule is 
that X n is gotten by adding to a linear combination of X n _i and X n _ 2 their commutator 
X n _ 2 ]. Given the first two matrixes, X and X\, the sequence is determined. This is 
more like a simple instruction in computer science than a law of physics, and we are able 
to argue it is almost unique, given a few simple conditions. 

That almost unique evolution rule acts on a configuration space of matrices, whose 
interpretation depends on a separation of time scales. For certain initial configurations- 
there will be a long time scale, T Newton such that, for times shorter than T Newton , the dynam- 
ics can be approximately described by a fixed law acting on a fixed space of states. Both 
that law and that state are coded into the X n . But for longer times everything evolves, 
laws and states together, and it is impossible to cleanly separate what part of the evolu- 
tion is changes in law and what part is changes in state. Furthermore, which information 
in M. evolves slowly, and goes into the specification of the approximate time independent 
law, and which evolves fast, and goes into the description of the time dependent state, is 
determined by the initial conditions. 

So the question of "why these laws" becomes subsumed into the question of "why 
these initial conditions" in a metatheory. This does not yet solve the problem of explain- 
ing the particular features of the standard model and its parameters, but it gives a new 
methodology and strategy with which to search for the answer. 

Starting from the standard model, one might move in the direction of a metatheory 
be elevating all parameters to degrees of freedom. This is something like what happens 
in the string landscape. Here we make a simple model in which the meta-state is a large 
sparse matrix, perhaps representing the connections on a graph. 

In the next section we describe a simple model which illustrates these ideas and show 
how it leads, for short time scales, to an approximate distinction between an effective law 
which governs the evolution of a state. 

2 A minimal evolution rule 

We are interested in the most minimal evolution rule we can imagine which combines the 
theory and the state. Let us specify the meta-state by an N x N antisymmetric matrix of 
integers, {X n ) ab = —(X n ) ba . We will consider the dimension N to be large. The n refers 
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to a succession of times, n = 0, 1, 2, ... also labeled by integers. (X n ) ab might be taken to 
describe an adjacency matrix of a weighted, directed, graph, whose edges are labeled by 
integers. This accords with the expectation that the fundamental variables in physics be 
relational. 

The idea is that there will be an evolution rule which specifies the series of matrices, 
given initial choices. The choice of this evolution rule is fixed by the following ideas. 

1. The evolution rule should mimic second order differential equations, as these are 
basic to the dynamics of physical systems. So two initial conditions should be re- 
quired to generate the evolution. We should then need to specify X and Xi to 
generate the sequence. We are then interested in rules of the form. 

X n = /"(Xn.!, X n ^ 2 ) (1) 



2. The changes should be small from matrix to matrix, at least given suitable initial 
conditions. This is needed so that there can be a long time scale on which some of 
the information in the matrixes are slowly varying. This makes it possible to extract 
a notion of slowly varying law, acting on a faster varying state. So we will ask that 

X = F(X,X) (2) 



3. We require that the evolution rule be non-linear, because non-linear laws are needed 
to code interactions in physics. But we can always use the basic trick of matrix mod- 
els of introducing auxiliary variables, by expanding the matrix, in order to lower the 
degree of non-linearity. This accords with the fact that the field equations of general 
relativity and Yang-Mills theory can, by the use of auxilary variables, be expressed 
as quadratic equations^]. The simplest non-linear evolution rule will then suffice, 
so we require a quadratic evolution rule. 

4. Time reversal invariance, at least at the linear level. 



A simple evolution rule that realizes these is 



X„ 



(3) 



This rule is not unique, but it is nearly so. It is easy to derive the general rule satisfying 
the four requirements just mentioned. 

The rule ((T]) can only have a linear term and a quadratic term. The quadratic term must 
be a function of X n _i and X n _ 2 that vanishes when they are equal and is antisymmetric. 
The unique term that does this is the commutator [X n _i,X n _ 2 ] . When the commutator 



2 As for example in the Plebanski action. 
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vanishes there is only a linear term which by (12) must be equal to a linear integral combi- 
nation of X n _i and X n _ 2 . The general evolution rule satisfying the first three requirements 
is then 

X n = aX n _ x + (1 - a)X„_ 2 + g[X n -x,X n _ 2 \ (4) 

where a and g must be integers to keep the coefficients of X n integers. 

We pick a = 2 and g = 1 to get ([3]). The justification for the choice of the linear term is 
time reversal invariance. With this choice © can be written as 

A 2 X n = X n + X n -2 — 2X„_i = [X n -±, X n _ 2 ] (5) 

The linear term, A 2 X n is invariant under a time reversal transformation around a time 
n — 1, given by 

<-> X n _x_ a (6) 

under which A 2 X n -> A 2 X„ 

The whole dynamics is approximately invariant under a related transformation X n _i + „ -H- 
— X n _i_„. An exactly time invariant version of dynamics would be X n = 2X n _i — X n _ 2 + 
[X n , X n _ 2 ], but this is much harder to evolve. 

Here is a way to understand how state and law are combined under this evolution 
rule. Let us call the "Hamiltonian at time n" , 

H n = X n _ 2 (7) 

and define the "state at time n" to be 



Pn — X n — X n _i (8) 

We can define the rate of change of the state as 

Ap„ = Pn - p n _! = A 2 X„ (9) 
Then the evolution rule © is expressed as 

Ap n = \p n . 1 ,H n ] (10) 

Thus it appears that the matrix we call H n is generating evolution on the state called p. 
Another equivalent way to express the evolution is 

A 2 p n = [Ap n _ 1; X„_ 2 ] (11) 



2.1 Quasi-Hamiltonian evolution 

Equations (|9|10[) holds at all time steps. But this is not really Heisenberg evolution because 
the operator we are calling the Hamiltonian evolves as the state evolves. But, as I will now 
show, if we choose the initial conditions so p is in a certain sense small compared to H, 
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then the H evolves more slowly than p and so for short times it appears as if the state 
is evolving with respect to a fixed Hamiltonian, so that (|8|10[) are, for a finite time, well 
approximated by a Heisenberg-like equation of motion, 

&p n = \p n -i,H ]. (12) 

To show this we introduce a norm on matrices ||X|| which is equal to the number of 
non-zero entries. Then, if p(X) is the probability that a matrix element is non-zero, then 

" m = <13) 

Pick an arbitrary time and call it n — 0. Call 

X = Ho, pi = A (14) 

Then define 

A = [A,H },A = [A, H ], ....A® = [A^tHo] (15) 

2.2 The first steps 

Let us follow the first few steps of evolution 

Xo = H pi = A 

X x = Ho + A p 2 = A + A 

X 2 = Ho + 2A + A p 3 = A + 2A + A + [A, A] 

X 3 = H + 3A + 3A + A + [A, A] p 4 = A + 3A + 3A + A {3) + [A, A] + [A, A] 

+[2A + A + [A,A},2A + A] 

X 4 = H + AA + 6A + AA + A^ + 2[A, A] 

+[A,A] + [2A + A+[A,A},2A + A] (16) 

Clearly terms are rapidly proliferating. To make sense of them, begin by noting that 
there are two kinds of terms in the p n s. First there are terms that involve single powers 
of A^ p \ These come from commutators with H and can be considered to be the effect of 
evolution with a fixed hamiltonian, H . Then there are terms involving commutators of 
two or more A^ p \ These register the effect of the changing evolution law. As we will now 
show, there are natural choices of H and A such that the latter remain unimportant for a 
large number of time steps. 

2.3 Norms and probabilities 

Let us pick X = H to be a random matrix chosen from the ensemble with 

p(Ho) = i (17) 
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so that it corresponds to the critical region in random graph theory of a graph which is 
minimally connected. Then 

\\H \\ = N (18) 

We will pick A to have a norm of order unity, so that X 1 differs from X = H by just a 
few entries or links. Then 

I Ml I = M e 0(1) so that p(A) « ^- (19) 
Let us assume that there are no further correlations between H and A so that, 

p(A) = Np{A)p{H Q ) » ^ (20) 

so that 

||i||wM (21) 

It then follows that all the 

p (p) ||^M (22) 
Notice that because these A^ are so sparse 

p([A, A]) = p([AW, A®]) = np{Af = ^ (23) 
Hence the norm of these commutators is 

IP (P U (9) ]|| = ^<<1 (24) 
This means that there are no entries in most of these commutators. 

2.4 Breakdown of the distinction between law and state 

Now after n evolution steps we will have a time dependent Hamiltonian H n = X n _ 2 of 
the form 

H n = H + 5H n (25) 
where the time dependent part has the form, 

5H n = 5H n (M) + 5H n (M 2 ) + ... (26) 

where (M p ) signifies the terms of order M p The leading term collects terms of order M 
which come from commutators of the form [A^, H ]. 

n—3 

5H n (M) = J2 C P A(P) ( 27 ) 
P =i 

10 



Here the c p are integer coefficients. 

Any one of the terms likely has no effect on the evolution of the p's, but there are 
n of them so SH n will start to be significant when n is large enough. Hence, ignoring the 
0(M 2 ) terms, we have 

p{SH n {M))=nN~ = ^ (28) 
These will be negligible compared to the terms in H if 

p(SH n (M)) = nM_ 
P(H ) N <l 

so the approximation in which we neglect the terms in SH n (M) is good so long as 

N 

n < Tf (30) 

We can reach the same conclusion by computing the ratio of p(A^) to p({A( p \ A^]) 

Similarly we can compute the importance of the order M 2 terms in 5H n . These come 

from commutators of the form [A^\ A™]. Any one of these is most likely vanishing, but 

there are n 2 of them. 
We have, 

( M \ 2 n 2 M 2 
P (8H n (M))=n 2 N I") (31) 



N 2 J N 3 

These can be neglected relative to the entries in H so long as 

p(SH n {M 2 )) n 2 M 2 



p(H ) ~ N 2 



< 1 (32) 

which leads us to the same condition (|30|) . Indeed, the order M q term in 5H n comes from 
q — 1 commutators of factors A^ for p < q, so these have probabilities 

P (6H n (M«)) = n^-^-J =^ (33) 

These are each negligible compared to the matrix elements of H so long as (f30|) holds. 
One can also show this for the sum (|26l> . Assuming the matrices are random so that the 
commutators are uncorrelated in the limit of large N we can write, 

p(5H n ) = p(5H n (M))+p(5H n (M 2 ) + ...) 

2 




-i nM_ 



H 



Hence 



p(SH n ) 



nM 
N 



(35) 



P(H ) 1 



nM 
N 



which is small so long as ([30]) holds. This means that the Hamiltonian evolution law ((12)) 
is a good approximation to the exact dynamics so long as (|30"1) holds. 

3 Conclusions 

In the introduction of this paper we argued for that a cosmological theory must be for- 
mulated in a way in which the usual distinction between dynamics and state, or between 
kinematics and dynamics, breaks down[lJ. In the rest of this paper we illustrated these 
ideas with a simple toy model. In it we addressed the problem of what determines the 
meta-law by which effective laws evolve by specifying four simple properties that almost 
completely determine it. We conjecture that the remaining freedom is unimportant, be- 
cause there may be a principle of universality among the remaining choices, in the sense 
that the predictions made by each of them can be mapped to each other. 

There remain of course open questions, among which are to demonstrate this conjec- 
ture of universality. 
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